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Spectrum modeling 



The invention relates to modeling a target spectrum by determining filter 
parameters of a filter which has a fi^uency response approximating the target spectrum. 

P. Stoica and R.L. Moses, Introduction to spectral analysiSy Prentice Hall, 
5 New Jersey, 1997, pp. 101-108, disclose parametric mefliods for modeling rational spectra. In 
general, a moving-average (MA) signal is obtained by filtering white noise with an all-zero 
filter. Owing to this all-zero structure, it is not possible to use an MA equation to model a 
spectrum with sharp peaks unless the MA order is chosen 'sufi&ciently large'. This is to be 
contrasted to the ability of the auto-regressive (AR), or all-pole, equation to model narrow- 

1 0 band spectra by using fairly low model orders. The MA model provides a good 

approximation for those spectra which are characterized by broad peaks and sharp nulls. Such 
spectra are encountered less firequently in applications than narrow-band spectra, so there is 
somewhat limited engineering interest in using MA signal model for spectral estimation. 
Another reason for this limited interest is that the MA parameter estimation problem is 

15 basically a non-linear one, and is significantly more difficult to solve than the AR parameter 
estimation problem. In any case, the types of difficulties in MA and ARMA estimation 
problems are quite similar. 

Spectra with both sharp peaks and deep nulls cannot be modeled by either AR 
or MA equations of reasonably smaU orders. It is in these cases where the more general 

20 ARMA model, also called pole-zero model, is valuable. However, the great initial promise of 
ARMA spectral estimation diminishes to some extent because there is yet no well-established 
algorithm fiom both theoretical and practical standpoints for ARMA parameter estimation. 
The theoretically optimal ARMA estimators' are based on iterative procedures whose global 
convergence is not guaranteed. The "practical ARMA estimators' are computational simple 

25 and often reliable, but their statistical accuracy may be poor in some cases. The prior art 
discloses two stage models, in which first an AR estimation is performed and fliereafter an 
MA estimation. Both methods give inacciurate estimates or require high computational effort 
in those cases where the poles and zeroes of the ARMA model description are closely spaced 
together at positions near the unit circle. Such ARMA models, with nearly coinciding poles 

CONFIRMATION COPY 
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and zeroes of modulus close to one, correspond to nairow-band signals. In both methods, the 
estimation of the zeros translates to a non-linear optimization problem. 

An object of fhe invention is to provide less complicated ARMA spectrum 

5 modeling. To this end, the invention provides a method and a device for modeling a target 
spectrum, a method of encoding an audio signal, a method of decoding an encoded audio 
signal, an audio encoder, an audio player, an audio system, an encoded audio signal and a 
storage medium as defined in the independent claims. Advantageous embodiments are 
defined in the dependent claims. 

10 In a first embodiment of the invention, the spectrum to be modeled is split into 

a fiist part and a second part wherein the first part is modeled by a first model to obtain auto- 
regressive parameters and the second part is modeled by a second model to obtain moving- 
average parameters. The combination of the constituent processes provides an accurate 
ARMA model. The splitting is preferably performed in an iterative procedure. In a method 

1 S according to the invention, a non-linear optimization problem may be omitted. 

The iavention provides an ARMA model estimation that is suitable for a real- 
time implementation. The invention recognizes that AR or MA models are not always 
sufficiently accurate or parsimonious in conveying the information of the power spectral 
estimate. On a logarithmic scale, with Linear Predictive Coding (LPC) methods (all-pole 

20 modeling) peaks of the function are usually well modeled but valleys are under-estimated. 
The reverse occurs in an all-zero model. In audio and speech coding, which is a preferred 
field of application of the invention, a logarithmic scale is more appropriate than a linear 
scale. Therefore, a good fit to the power spectrum on a logarithmic scale is preferred. The 
model according to the invention gives a better trade-off between complexity and accuracy. 

25 The error in this model can be evaluated on a logarithmic scale. 

In a preferred embodiment of the invention, the second modeling operation 
comprises the step of using the first modeling operation on a reciprocal of the second part of 
the target spectrum. In this embodiment, only one modeling operation needs to be defined 
wherein the auto-regressive parameters are obtained by modeling the first part of the 

30 spectrum and the moving-average parameters are obtained by modeling a reciprocal of the 
second part of the spectrum by the same, i.e. first modeling operation. Although less 
preferred, it is also possible to use a second modeling operation that yields moving-average 
parameters on the second part and, to obtain auto-regressive parameters use the same second 
modeling operation on a reciprocal of the first part of the spectrum. 



wo 01/89086 PCT/EP00/04S99 



The invention is preferably used in parametric modeling of a noise component 
in an audio signal. The audio signal may comprise audio in general like music, but also 
speech. Besides the advantages mentioned above, an ARMA model according to the 

S invention has the further advantage that for an accurate modeling of the noise component less 
parameters are necessary than would be the case in full AR or MA modeling with a 
comparable accuracy. Less parameters means better conq)ression. 

Although the invention is preferably used in parametric modeling of a noise 
component in an audio signal, the invention may also be used in noise suppression schemes, 

10 in which an estimate of a noise spectrum is subtracted &om a signal. 

In the prior art methods according to Stoica and Moses, computational burden 
exists in matrix inversions. Further, it is unclear to which value the order of the AR model 
should be set, except that it needs to be high for zeros close to the unit circle. Therefore, the 
computational complexity is difEcult to access. In the method according to the invention, 

1 5 computational burden exists in the iterative nature of the splitting process and the 

transformation to the frequency domain (Stoica and Moses calculate primarily in the time 
domain). The invention provides better results in case of zeros close to the unit circle. 
Furthermore, the transformation to the frequency domain opens the possibility of 
manipulations. An example is to make the split frequency dependent on the basis of a priori 

20 or measurement data. Another advantage is the applicability to waiped frequency data, as is 
explained below. In order to guarantee real-time ARMA modeling, a fast transformation to 
the frequency domain should be applied, e.g. Welch's averaged periodogram method which 
is well known in the art. 

Auto-regressive and moving average parameters can be represented in 

25 different ways by e.g. polynomials, zeros of the polynomials (together with a gain factor), 
reflection coefficients or log(Area) ratios. In an audio coding application, representation of 
the auto-regressive and moving average parameters is preferably in log(Area) ratios. The 
auto-regressive and moving average parameters that are determined in the ARMA modeling 
according to the invention are combined to obtain the filter parameters that are transmitted. 

30 WO 97/28527 discloses the enhancement of speech parameters by determining 

a background noise PSD estimate, detemtiining noisy speech parameters, determining a noisy 
speech PSD estimate from the speech parameters, subtracting a background noise PSD 
estimate from the noisy speech PSD estimate, and estimating enhanced speech parameters 
fit>m the enhanced speech PSD estimate. The enhanced parameters may be used for filtering 
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noisy speech in order to suppress the noise or be used directly as speech parameters in speech 
encoding. An estimate of the PSD is obtained by an auto-regressive model. It is noted in this 
document that such an estimate is not a statistically consistent one, but that in speech signal 
processing that is not a serious problem. 

5 US-A 5,943,429 discloses a spectral subtraction noise suppression method in a 

firame based digital communication system. The method is performed by a spectral 
subtraction function which is based on an estimate of the power spectral density of 
background noise of non-speech firames and an estimate of the power spectral density of 
speech firames. Each speech firame is approximated by a parametric model that reduces the 

10 number of degrees of fireedom. The estimate of the power spectral density of each speech 
firame is estimated firom tiie spproximative parametric model. Also m this case, the 
parametric model is an AR model. 

US-A 4,188,667 discloses an ARMA filter and a method for obtaining the 
parameters for such filter. The first step of this method involves performing an inverse 

1 5 discrete Fourier transform of the arbitrary selected firequency spectrum of anq)litude to obtain 
a truncated sequence of coefficients of a stable pure moving*»average fiflter model, i.e. the 
parameters of a non-recursive filter model. The truncated sequence of coefi&cients, which has 
N+1 terms, is then convolved with a random sequence to obtain an output associated with the 
random sequence. A time-domain, convergent parameter identification is then performed, in 

20 a manner that minimizes an integral error fimction norm, to obtain the near miTiimum order 
auto-regressive and moving-average parameters of the model having the desired amplitude- 
and phase-fi^quency responses. The parameters are identified off-line. The object of this 
embodiment is to provide a minimnTn or near minimum stable ARMA filter. The parameters 
are determined in a batch filter program. 

25 In general, estimating a power spectral density fiinction differs fi"om 

characterizing a linear system in that, inter alia, in such characterization, the mpnt and output 
signals are available and used, whereas in estimating a power spectral density function, only 
the power spectral density fimction is available (not an associated input signal). 

The aforementioned and other aspects of the invention will be apparent firom 

30 and elucidated with reference to the embodiments described hereinafter. 



In the drawings: 

Fig. 1 shows an illustrative embodiment comprising an audio encoder 
according to the invention; 
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Fig. 2 shows an illustrative embodiment comprising an audio player according 
to the invention; 

Fig. 3 shows an illustrative embodiment of an audio system according to the 

invention; 

5 Fig. 4 shows an exemplary mapping function m; and 

Fig. 5 shows an embodiment of a noise suppression device in accordance with 

the invention. 

The drawings only show those elements that are necessary to understand the 

invention. 

io 

The invention is preferably ^plied in audio and speech coding schemes in 
which synthetic noise generation is employed. Typically, ttie audio signal is coded on a frame 
to frame basis. The power spectral density frmction (or a possibly non-uniform sampled 
v^ion thereof) of the noise in a frame is estimated and a best appioxinoiation of the function 

IS from a set of squared amplitude responses of a certain class of filters is found In one 
embodiment of the invention, an iterative procedure is used to estimate an ARMA model 
based on existing low-complexity techniques for fitting AR and MA models to tibie power 
spectral density function. 

Fig. 1 shows an exemplary audio encoder 2 according to the invention. An 

20 audio signal A is obtained from an audio source 1, such as a microphone, a storage medium, a 
network etc. The audio signal A is input to the audio encoder 2. The audio signal A is 
parametrically modeled in flie audio encoder 2 on a frame to frame basis. A coding imit 20 
comprises an analysis unit (AU) 200 and a synthesis unit (SU) 201 . The AU 200 performs an 
analysis of the audio signal and determines basic waveforms in the audio signal A. Further, 

25 the AU 200 produces waveform parameters or coefficients G to represent the basic 
waveforms. The waveform parameters C/ are fizmished to the SU 201 to obtain a 
reconstructed audio signal, which consists of synthesized b^ic waveforms. This 
reconstructed audio signal is furnished to a subtracter 21 to be subtracted from the original 
audio signal A, This rest signal S is regarded as a noise component of the audio signal ^. In a 

30 preferred embodiment, the coding unit 20 comprises two stages: one that performs transient 
modeling, and another that performs sinusoidal modeling on the audio signal after subtraction 
of the modeled transient components. 

According to an aspect of the invention, the power spectral density function of 
the noise component ^ m the audio signal A is ARMA modeled resulting in auto-regressive 



wo 01/89086 PCT/EPOO/04599 

6 

parameters p/ and nioving-average parameters qi. The spectrum of the noise component S is 
modeled according to the invention in noise analyzer (NA) 22 to obtain filter parameters 
(Pi,qi). The estimation of the parameters (phqd is perfoimed by determining filter parameters 
of a filter in NA 22 which has a transfer fimction H"' that makes ttie Amotion S after filtering, 

5 i.e. JT'(S), spectrally as flat as possible, i.e. Vhitening tiie frequency spectrum*. In a decoder, 
a reconstructed noise conq)onent can be generated which has approximately the same 
properties as the noise component S by filtering white noise with a filter with transfer 
fimction H that is opposite to the filter used in the encoder. The filtering operation of this 
opposite filter is determined by the ARMA parameters pi and qt. The filter parameters (pi,qD 

10 are included together with the waveform parameters C/ in an encoded audio signal A'im 
multiplexer 23. The audio stream A 'is fiimished fix>m the audio encoder to an audio player 
over a coimnimication channel 3, which may be a wireless connection, a data bus or a storage 
medium, etc. 

An embodiment comprising an audio player 4 according to the invention is 

1 5 shown in Fig. 2. An audio signal A ' is obtained from the communication channel 3 and de- 
multiplexed in de-multiplexer 40 to obtain the parameters (pi,qi) and the waveform 
parameters Q that are included in the encoded audio signal A ' The parameters (puqd are 
fiimished to a noise synthesizer (NS) 41 . The NS 41 is mainly a filter with a transfer fimction 
H. A white noise signal;; is input to the NS 41. The filtering operation of the NS 41 is 

20 determined by the ARMA parameters (p/,^/). By filtering the white noise y with the NS 41, 
that is opposite to the filter (NA) 22 used in the encoder 2, a noise component iS'is generated 
which has approximately the same stochastic properties as the noise component S in the 
original audio signal A, The noise component 5" is added in adder 43 to other reconstructed 
components, which are e.g. obtained firom a synthesis unit (SU) 42 to obtain a reconstructed 

25 audio signal (A'O* The SU 42 is similar to the SU 201 . The reconstructed audio signal A" is 
fiimished to an output 5, which may be a loudspeaker, etc. 

Fig. 3 shows an audio system accordiag to the invention comprising an audio 
encoder 2 as shown in Fig. 1 and an audio player 4 as shown in Fig. 2. Such a system offers 
playing and recording features. The communication chaimel 3 may be part of the audio 

30 system, but will often be outside the audio system. In case the communication channel 3 is a 
storage medium, the storage medium may be fixed in the system or be a removable disc, 
memory stick, tape etc. 

Below, the modeling of the spectrum of 5 is further described. Suppose 5 is a 
power spectral density fimction of a discrete-time real valued signal. Further, iS is a real- 
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valued function defined on the interval / = (-7i,7t). S is assumed to be symmetric with min (S) 
> 0 and max (S) < oo. For convenience, it is assumed that the logarithmic mean of S equals 
zero, i.e. 

'^[]nSi0)d0^O (1) 

5 The extension to cases with a mean on the log scale unequal to zero is straight forward, but 
can be handled in various ways. Note that can be derived from samples of an actually 
measured power spectral density function by suitable interpolation and normalization. 
Let ^be a rational transfer function according to H- B/A with 

A = JQ.^^ (1 - z Pi) and B = Jj.^^ (1 - z'^qi) . Here,/?,- and qi are the poles and the zeros of 

10 the transfer function H, respectively. Note, that the logarithmic mean of \H\^ also equals zero. 

The target function is approximated by the squared modulus of H, i.e. 

S^\H\\ 

A measure for the correctness of the approximation is introduced by: 
J~ikl^S-\n\H\yd0 (2) 
1 5 The criterion (2) can be rewritten to 

•/=^{ln(S/|ifp) + ^(5/|2?|'))^d^ (3) 

in view of the fact that both S and \H\^ have a logarithmic mean equal to zero. If 
furthermore, »1 for each 0, flie criterion (2) is ^proximatedby /-I, where 

20 This means that in the neighborhood of the optimal solution, the criteria (2) and (4) are 
practically equal. 

It is well known that in the case that if =1 /A (i.e. 5=1), (4) is associated with 
Forward Linear Prediction (FLP), which is an example of an LPC method. Therefore, the 
polynomial^ can be found by calculating (or at least approximating) the auto-correlation 
25 function associated with S and solving the Wiener-Hopf equations. The qualitative results of 
such a procedure are also well known. The above sketched procedure will give good 
approximations to the peaks of 5 (when measured or visualized on a logarithmic scale) but 
usually provides only poor fits to the valleys ofS. To conclude the above, a standard 
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procedure is available for estimating an all-pole model from the power spectral density 
function, which provides an approximation to the optimal solution with (2) and which 
basically is good at modeling the peaks of 5. 

It is noted that peaks and valleys of hi 5 have essentially the same 
5 characteristic except for a reversal of sign: a peak is a positive excursion, wh^as a trough is 
a negative one. Consequently, taking 5 = 1 / , an all-zero model can be estimated by using 
the above sketched procedure for an all-pole model. From the result of this procedure, a good 
fit to the valleys of S is expected, but only poor or at most fair fits to the peaks ofS. 

An object of the invention is to provide a good representation ofS for both the 
1 6 peaks and the valleys. In an embodiment of the invention, an ARMA model is provided in 
which all-pole modeling and all-zero modeling are combined in the following way. iS^ is split 
in two parts bsS-Sa/Sb. From Sl< an all pole model is estimated yielding the polynomial A 
and fix)m iS!a an all-zero model is estimated yielding the polynomial B. The combination 
= \B\^ /\A\^ is considered an approximation ofS. 

1 5 According to a preferred aspect of the invention the split of 5 is performed in 

an iterative process. The iteration step is called /. At each step of the iteration, a new spht Saj 
and Sbj is generated and the corresponding estimates Ai and Bi are calculated. A given 
subdivision ofS in Sa and Sb is used to start with and thereafter parts oiSs that are not 
modeled accurately are attributed to Sa and vice versa. At step M in the iterative scheme, Hi^i 

20 = BiA I Aui. Hereafter, the partial functions S^ ^ = S ^ and iS^^ = 1 / ^jii;., f are 

considered. In this way, from S those parts that can be modeled accurately by the all-pole 
model are excluded from contributing to Sb> Similarly, those parts of S that could be modeled 
by an all-zero filter are excluded from Sa. From Saa and Sb,i the functions Ai and -8; are 
estimated. In this way, parts which in the previous iteration could not be modeled 
25 appropriately are swapped. 

For a next step, prefwably, the following four possible combinations are considered: 

G^^B^IAi^y G^^Bf/Af 

The best fit to S of these four candidate filters is defined as the one with minimum error, the 
30 associated filter is the final result of step /. Preferably, Hi (and tiius Ai and Bi) is selected as 
the best of the candidates G| with i - 0,1,2,3 on a logarithmic criterion according to 

=argmin— [QnS -hjlGffdff (5) 
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From here, the procedure is proceeded with step / + 1, by taking S^j^^ =5/1-811^ and 

Any common stop procedure can be used, e.g. a maximimi number of 
iterations, a sufficient accuracy of the current estimate, or insufficient progress in going from 
5 one step to another. 

A slightly different procedure performs the AR and MA modeling alternately. 
If the previous step returned a refined estimate of the numerator jBm, then 

and calculate Ai. Bi is taken as Bi.\ . 
10 If the previous step returned a lefined estimate of the numerator Ai.\, then 

S,j=l/S\A..,\' 

and calculate Bi. Ai is taken as Au\ . 

From Ai and Bu Hi is constructed and the enor evaluated (e.g. a mean squared difference on a 
log scale) 

IS There are many alternatives to initialize the iterative scheme. Without 

limitation, the following possibilities are mentioned: 

First, a simple way of mitializmg is provided by taking Sa,q = 5 and iS!ff,o = 1 
aiid Sas^ = 1 and I/Sb/} = S. Next, AomdBozie calculated. From these two initial estimates, a 
best fit (according to some criterion) is chosen. In this way, the first guess is either an all-pole 
20 or an all-zero model. 

Second, S may be split in equal parts according to 5^ o = 1 ' ^b.q = • 
Third, since Sa shotild contain the peaks and 5b the valleys, a fevorable split is 
to attribute everything above a mean logarithmic level (e.g. above zero) to SAfi and anything 
below said level to Sb,o^ This division may be made at the global logarithmic mean, but also 
25 at some local logarithmic mean. 

Fourth, a further splitting process takes into account that in power spectral 
density functions on a logarithmic scale, poles and zeros close to the imit circle give rise to 
' pronounced peaks and valleys, respectively. The data S is split on the notion that peaks and 
valleys in logiS are more appropriately handled by the all-pole and all-zero model, 
30 respectively. Define: 
P = log5 
Pa^IosISa 
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Consider the mapping function m with m : 31 -> [- l,l]. The mapping function will typically 
be a non-decreasing, point-symmetric sigmoidal function in view of the synometry of pole 
and zero behavior on a log scale. However, non-symmetric functions can be used as well and 
5 have the effect of giving more weight to either the pole or the zero modeling. An exemplary 
mapping function m is shown in Fig. 4. 
Consider the following mitial split: 

^ 2 

2 

10 In this way, positive excursion (peaks) of P are pre-dominantly attributed to Pa and, 

consequently, modeled by the all-pole filter. Negative excursions (valleys) of P are mostly 
attributed to Pb and, consequently, modeled by the all-zero filter. From Pa and Psy Sa and Sb 
are constructed and, next Aq and aie calculated. 

There are two limiting cases of m (which are sinular to the second and the third initialization 
15 as discussed above): 

- m = 0, then =l/'S'p,o=>/^ 



m is a signum function: m{x) = 



-1,a:<0 

0, jc = 0 

1, jc>0 



In this case: 

S(x),S(x)>l 

S{xlS{x)<l 



20 l/Sj,(x)^< 



l,S(x)kl 



The proposed spectrum modeling is very apt at modeling peaks and valleys 
since, basically, these constitute the patterns generated by the degrees of freedom offered by 
the poles and zeros. Consequently, the procedure is sensitive to outliers: rather than 
25 smoothing, these will appear in the ^proximation. Therefore, the input data iS* has to be an 
accurate estimate (in the sense of a small ratio of standard deviation and mean per frequency 
sample) or S must be pre-processed (e.g. smoothed) in order to siqppress undesired modeling 
of outliers. This observation holds especially if the number of degrees of freedom in ttie 
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model is relatively large with respect to the number of data points on which the power 
spectral density function is based. 

Convergence can not be established without knowledge of the actual 
optimization steps A and B and the selection criterion. It is not guaranteed that the error 

S reduces at every step in the iteration process. 

Li many cases, it is desired to have a good approximation of the power spectral 
density function on a logarithmic scaled frequency axis. For example, it is common practice 
to evaluate the result of a fit on a spectrum visually in the form of a Bode plot. Similarly, for 
audio and speech plications, the preferred scale would be a Baric or Equivalent Rectangular 

10 Bandwidth (ERB) scale which is more or less a logaiitimiic scale. The method accordin 
flie invention is suitable for frequency-warped modeling. The spectral density measurements 
can be calculated on any frequency grid whatsoever. Under the condition that the frequency 
warping is close to that of a first-order all-pass section, this can be re-wrapped while 
maintaining the order of the ARMA model. 

IS AppUcation areas of the invention include audio coding, buried data 

techniques, noise shq)ing and fiist filter design. A further exemplary embodiment of the 
invention is shown in Fig. 5. In Fig. 5 an audio signal A is obtained from a source 1 in a 
similar way as in Fig. 1 . The audio signal A is processed in a noise-suppression device 6. The 
noise-suppression device comprises a noise analyzer (NA) 60 and a noise synthesizer (NS) 

20 61 . In this embodiment, the NA 60 directly analyzes noise in the audio signal. A spectrum of 
the noise is modeled by determining ARMA parameters (pi,qi) according to the invention. 
The NS 61, which is mainly a filter, has a frequency response approximating the spectrum of 
the noise. The NS 61 generates reconstructed noise by filtering a white noise y, wherein the 
filtering properties of NS 61 are detCTmined by the ARMA parameters (pi,qi). In an adder 61, 

25 the reconstructed noise is subtracted from the audio signal (A) to obtain a noise-filtered audio 
signal ({A}). Preferably, the noise spectrum is modeled in one or more (previous) frames 
that, besides noise, do not contain much signal, e.g. speech-free firames in speech coding. The 
reconstructed noise can be subtracted in frames that do contain more signal, e.g. speech 
frames in speech coding. 

30 It should be noted that the above-mentioned embodiments illustrate rather than 

limit the invention, and that those skilled in the art will be able to design many altemative 
embodiments without departing fix>m flie scope of tiie appended claims. In the claims, any 
reference signs placed between parentheses shall not be construed as limiting the claim. The 
word 'comprising' does not exclude the presence of other elements or steps than those listed 
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in a claim. The invention can he implemented by means of hardware comprising several 
distinct elements, and by means of a suitably programmed computer. In a device claim 
enumerating several means, several of these means can be embodied by one and the same 
item of hardware. The mere &ct that certain measures are recited in mutually different 
S dependent claims does not indicate that a combination of these measures cannot be used to 
advantage. 

In summary, modeling a target spectrum is provided by detennining jSlter 
parameters of a filter which has a frequency response approximating the target spectrum, 
wherein the target spectrum is spUt in at least a first part and a second part, a first modeling 

1 0 operation is used on the first part of the target spectrum to obtain auto-regressive parameters, 
a second modeling operation is used on the second part of the target spectrum to obtain 
moving-average parameters, and the auto-regressive parameters and the moving-average 
parameters are combined to obtain the filter parameters. The invention is preferably ^plied 
in audio coding, wherein a spectrum of a noise component in the signal is modeled. 

IS A model for fast ARMA estimation firom power spectral density data has been 

explained. It iises e.g. FLP techniques for the estimation of the numerator and the 
denominator polynomials and an iterative procedure to produce the most appropriate split in 
the power spectral density data to attribute parts of the data to the all-pole model and other 
parts to the all-zero model. 
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1 . A method of modeling (2JZ2) a target spectrum (S) by determining filter 
parameters (pi,qi) of a filter (41) which has a frequency response (S 0 approximating the 
target spectrum (S), 

characterized in that the method comprises the steps of: 

sphtting (22) the target spectrum in at least a first part and a second part; 

using (22) a first modeling operation on the first part of the target spectrum (S) 
to obtain auto-regressive parameters (pO; 

using (22) a second modeling operation on the second part of the target 
spectrum to obtain moving-average parameters (qO; and 

combining (22) the auto-regressive parameters (pi) and the moving-average 
parameters (q^) to obtain the filter parameters (pi.qi). 

2. A method as claimed in claim 1, wherein the second modeling operation (22) 
comprises the step of: 

using the first modeling operation on a reciprocal of the second part of the 
target spectrum. 

3. A method as claimed in claim 1, wherein the step of splitting (21) comprises: 
taking an initial split in an initial first part and an initial second part; and 
using an iterative procediire to obtain a better split than the initial split until 

some stop oiterion is met. 

4. A method as claimed in claim 3, wherein the iterative procedure comprises: 
using a first modeling operation on a first part of a previous spht to obtain new 

auto-regressive parameters; 

using a second modeling operation on a second part of a previous split to 
obtain new moving-average parameters; and 

re-attributing parts of the first part of the previous split that could not be 
modeled accurately by the first modeling operation to the second part of the previous split. 
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and parts of the second part of the previous split that could not be modeled accurately by the 
second modeling operation to the first part of the previous split to obtain a new split. 

S. A method as claimed in claim 4, wherein the step of re-attributing comprises: 

5 dividing the first part of the previous split by an estimate of the taxget 

spectrum based on moving-average parameters; and 

dividing the second part of the previous split by an estimate of the target 
spectrum based on auto-regressive parameters. 

10 6. A method as claimed in claim 2, wherein the initial first part conq)rises at least 

a significant part of the target spectrum above a mean logarithmic level and the initial second 
part comprises at least a significant part below said level. 

7. A method as claimed in claim 2, wherein the initial split is determined by: 

^ 2 

" 2 
where: 

P = logfthe target spectrum) 
Pa = log(the first part of the target spectrum) 
20 Pb = log(the second part of the target spectrum) 
and 771 is a mapping fimctionwifli 7?i:SR'^[-l,l]. 



8. A device (2), comprising: 

means (22) for determining filter parameters (pi,qi) of a filter (41) which has a 
25 fi^uency response (S 0 approximating a target spectrum, 

characterized in that the device fiirther comprises: 

means (22) for splitting the target spectrum (S) in at least a fixst part and a 

second part; 

means (22) for using a first modeling operation on the first part of the target 
30 spectrum (S) to obtain auto-regressive parameters (pO; 

means (22) for using a second modeling operation on the second part of the 
target spectrum (S) to obtain moving-average parameters (qOi 
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means (22) for combining the auto-regressive parameters (pO and the moving- 
average parameters (qO to obtain the filter parameters (pi,q{). 

9. A method of suppressmg noise (6) ui an audio signal (A), the me&od 
S comprising: 

modeling (60) a spectrmn of the noise by determining filter parameters (pi,qi) 
of a filter (61) which has a frequency response approximating the spectrum of the noise; 

obtaining (61) reconstructed noise by filtering (61) a white noise (y) with a 
filter (61), which properties are detennined by the filter parameters (pi,qO; and 
1 0 subtracting (62) the reconstructed noise from the audio signal (A) to obtain a 

noise-filtered audio signal ({A}); 

the step of modeling (60) comprising: 

sphtting (60) the spectrum in at least a first part and a second part; 
using (60) a first modeling operation on the first part of the spectrum to obtain 
1 S auto-regressive parameters (pi); 

using (60) a second modeling operation on the second part of the noise 
spectrum to obtain moving-average parameters (qO; and 

combining (60) the auto-regressive parameters (pi) and the moving-average 
parameters (qO to obtain the filter parameters (pi,qi); 

20 

10. A device (6) for suppressing noise in an audio signal (A), the device 
comprising: 

means (60) for modeling a spectrum of the noise by determining filter 
parameters (pi,qi) of a filter (61) which has a frequency response approximating the spectrum 
25 of the noise; 

means (61) for obtaining reconstructed noise by filtering (61) a white noise (y) 
with a filter (61), which properties are deteraiined by the filter parameters Cpi,qi); and 

means (62) for subtracting the reconstructed noise from the audio signal (A) to 
obtain a noise-filtered audio signal ({A}); 
30 the means for modeling (60) comprising: 

means (60) for splitting the spectrum in at least a first part and a second part; 

means (60) for using a first modeling operation on the first part of the 
spectrum to obtain auto-regressive parameters (pi); 
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means (60) for using a second modeling operation on the second part of the 
noise spectrum to obtain moving-average parameters (qO; and 

means (60) for combining the auto-regressive parameters (pO and the moving- 
average parameters (qj) to obtain the filter parameters ^i^qO; 

5 

11. A method of encoding (2,21) an audio signal (A), comprising the steps of: 

determining (200) basic waveforms in the audio signal (A); 

obtaining (21) a noise compon^t (S) from the audio signal (A) by subtracting 
the basic waveforms from the audio signal (A); 
10 modeling (22) a spectrum of the noise component (S) by deteraiining filter 

parameters (pi.qO of a filter (41) which has a frequency response (S *) approximating the 
spectrum of the noise component (S); and 

including (23) the filter parameters (pi,qi) and waveform parameters (Q) 
representing the basic waveforms in an encoded audio signal (A'); 
15 the step of modeling comprising: 

splittiiig (22) the spectrum (S) in at least a first part and a second part; 

using (22) a first modeling operation on the first part of the spectrum (S) to 
obtain auto-regressive parameters (pO; 

using (22) a second modeling operation on the second part of the noise 
20 spectrum (S) to obtain moving-average parameters (qi); and . 

combining (22) the auto-regressive parameters (pO and the moving-average 
parameters (qO to obtain the filter parameters (pi^qO- 



12. A method of decoding (4) an encoded audio signal (A% comprising the steps 

25 of: 

receiving (40) an encoded audio signal (A*) comprising waveform parameters 
(Q) representing basic waveforms and filter parameters (pi,qi), the filter parameters (pi,qi) 
being a combination of auto-regressive parameters (pO and moving-average parameters (qO 
as acquired in accordance with the method of claim 1 1 ; 
30 filtering (41) a white noise signal (y) to obtain a reconstructed noise 

component (S *), which filtering is determined by the filter parameters (puqOl 

synthesizuig (42) basic waveforms based on the waveform parameters (Cj); 

and 
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adding (43) the reconstructed noise component (S *) to the synthesized basic 
wavefoims to obtain a decoded audio signal (A'O. 



13. An audio encoder (2) coniprising: 

5 means (200) for detennining basic waveforms in the audio signal (A); 

means (21) for obtaining a noise component (S) fiom the audio signal (A) by 
subtracting (21) the basic waveforms from the audio signal (A); 

means (22) for modeling a spectrmn of the noise component (S) by 
detennining filter parameters (pi,qi) of a filter (41) which has a frequency response (S 0 
10 approximatmg the spectrum of the noise component (S); and 

means (23) for including the filter parameters (pi,qi) and waveform parameters 
(Q) representing the basic wavefoims in an encoded audio signal (A'); 
the means (22) for modeling comprising: 

means (22) for splittmg the spectrum (S) in at least a first part and a second 

15 part; 

means (22) for using a first modeling operation on the first part of the 
spectrum (S) to obtain auto-regressive parameters (pO; 

means (22) for using a second modeling operation on the second part of the 
noise spectrum (S) to obtain moving-average parameters (qO; and 
20 means (22) for combining the auto-regressive parameters (pi) and the moving- 

average parameters (qO to obtain the filter parameters (pi,qi). 

14. An audio player (4) comprising: 

means (40) for receiving an encoded audio signal (A*) comprising waveform 
25 parameters (Q) representing basic waveforms and filter parameters (pi,qi), the filter 

parameters (puqO being a combination of auto-regressive parameters (pi) and moving-average 
parameters (qj) as acquired in accordance with the method of claim 1 1 ; 

means (41) for iBltering a white noise signal (y) to obtain a reconstructed noise 
component (S *)» which filtering is determined by the filter parameters (pi,qi); 
30 means (42) for synthesizing basic wavefoims based on the waveform 

parameters (Q); and 

means (43) for adding the reconstructed noise component (S 0 to the 
synthesized basic waveforms to obtain a decoded audio signal (A")- 
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15. An audio system comprising an audio encoder (2) as claimed in claim 13 and 
an audio player (4) as claimed in claim 14. 

16. An encoded audio signal (A ') comprising: 

5 waveform parameters (CO representing basic waveforms; and 

a spectrum of a noise component (S) represented by a combination of auto- 
regressive parameters (pj) and moving-average parameters (qO as acquired in accordance with 
the method of claim 11. 



10 



17. A storage medium (3) on which an encoded audio signal (A*) as claimed in 

claim 16 is stored. 
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